Global Testing under Sparse Alternatives: Anova, Multiple Comparisons and the Higher Criticism1 by Ery Arias-castro,

نویسنده

  • EMMANUEL J. CANDÈS
چکیده

Testing for the significance of a subset of regression coefficients in a linear model, a staple of statistical analysis, goes back at least to the work of Fisher who introduced the analysis of variance (ANOVA). We study this problem under the assumption that the coefficient vector is sparse, a common situation in modern high-dimensional settings. Suppose we have p covariates and that under the alternative, the response only depends upon the order of p1−α of those, 0 ≤ α ≤ 1. Under moderate sparsity levels, that is, 0 ≤ α ≤ 1/2, we show that ANOVA is essentially optimal under some conditions on the design. This is no longer the case under strong sparsity constraints, that is, α > 1/2. In such settings, a multiple comparison procedure is often preferred and we establish its optimality when α ≥ 3/4. However, these two very popular methods are suboptimal, and sometimes powerless, under moderately strong sparsity where 1/2 < α < 3/4. We suggest a method based on the higher criticism that is powerful in the whole range α > 1/2. This optimality property is true for a variety of designs, including the classical (balanced) multi-way designs and more modern “p > n” designs arising in genetics and signal processing. In addition to the standard fixed effects model, we establish similar results for a random effects model where the nonzero coefficients of the regression vector are normally distributed.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

GLOBAL TESTING UNDER SPARSE ALTERNATIVES: ANOVA, MULTIPLE COMPARISONS AND THE HIGHER CRITICISM By

Testing for the significance of a subset of regression coefficients in a linear model, a staple of statistical analysis, goes back at least to the work of Fisher who introduced the analysis of variance (ANOVA). We study this problem under the assumption that the coefficient vector is sparse, a common situation in modern high-dimensional settings. Suppose we have p covariates and that under the ...

متن کامل

Global Testing under Sparse Alternatives: ANOVA, Multiple Comparisons and the Higher Criticism

Testing for the significance of a subset of regression coefficients in a linear model, a staple of statistical analysis, goes back at least to the work of Fisher who introduced the analysis of variance (ANOVA). We study this problem under the assumption that the coefficient vector is sparse, a common situation in modern high-dimensional settings. Suppose we have p covariates and that under the ...

متن کامل

To “ Global Testing under Sparse Alternatives : Anova , Multiple Comparisons and the Higher Criticism ”

We prove the results stated in the main paper. We start by providing a brief summary of the notations used in the paper. Set [p] = {1, . . . , p} and for a subset J ⊂ [p], let |J | be its cardinality. Bold upper (resp. lower) case letters denote matrices (resp. vectors), and the same letter not bold represents its coefficients, e.g. aj denotes the jth entry of a. For an n × p matrix A with colu...

متن کامل

Community Detection in Sparse Random Networks

We consider the problem of detecting a tight community in a sparse random network. This is formalized as testing for the existence of a dense random subgraph in a random graph. Under the null hypothesis, the graph is a realization of an Erdös-Rényi graph on N vertices and with connection probability p0; under the alternative, there is an unknown subgraph on n vertices where the connection proba...

متن کامل

Detecting positive correlations in a multivariate sample

ERY ARIAS-CASTRO1, SÉBASTIEN BUBECK2 and GÁBOR LUGOSI3 1Department of Mathematics, University of California, San Diego, La Jolla, CA 92093, USA. E-mail: [email protected] 2Department of Operations Research and Financial Engineering, Princeton University, Princeton, NJ 08544, USA. E-mail: [email protected] 3Department of Economics, Pompeu Fabra University, 08005 Barcelona, Spain. E-mail...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011